# TRL Fine-tuning
Qwen3 8B Grpo Medmcqa
A fine-tuned version based on Qwen/Qwen3-8B using the medmcqa-grpo dataset, specialized in medical multiple-choice question answering tasks
Large Language Model
Transformers

Q
mlxha
84
1
Deepseek R1 Chinese Law
Apache-2.0
Llama model trained with Unsloth and Huggingface TRL library, achieving 2x faster inference speed
Large Language Model
Transformers English

D
corn6
74
2
Travelbot
Apache-2.0
Llama model trained with Unsloth and Huggingface TRL library, achieving 2x inference speed improvement
Large Language Model
Transformers English

T
kitty528
9,146
2
Featured Recommended AI Models